Demographic Breakdown of Twitter Users: An analysis based on names
نویسنده
چکیده
We propose an approach for age estimation using solely people’s first names by extending an already existing method proposed by Chang et al. for ethnicity estimation. We demonstrate that proposed method is able to predict age of a person as well as the age breakdown of an entire population better than the natural alternatives. We then apply both the age and the ethnicity method to Twitter US users and perform the largest demographic analysis of the platform to the best of our knowledge. First, we closely replicate the findings about Twitter demographics in the most recent Pew Research report suggesting that name might be a useful indicator especially for aggregate analysis. Second, we demonstrate that our approach can overcome a methodological limitation in Pew Research study by estimating breakdown for all age groups including less than 18 years old age group. Third, we discover that Twitter US users has always been diverse, though some demographic groups are over-represented and some are under-represented with respect to the general internet users. We also find strong evidence that different demographic groups both in terms of age and ethnicity have different usage patterns on the platform in terms of their following relationships, topical conversations, and the time in the day to use the platform.
منابع مشابه
Detection of Twitter Users' Attitudes about Flu Vaccine based on the Content and Sentiment Analysis of the Sent Tweets
Introduction: The influenza vaccine is one of the controversial challenges in today's societies. Considering the importance of using the flu vaccine in preventing the spread of influenza virus, the Twitter network, as a rich source of data, provides suitable conditions for research in this field to examine the attitudes of different people about this vaccine. The results in one hand will help h...
متن کاملDetection of Twitter Users' Attitudes about Flu Vaccine based on the Content and Sentiment Analysis of the Sent Tweets
Introduction: The influenza vaccine is one of the controversial challenges in today's societies. Considering the importance of using the flu vaccine in preventing the spread of influenza virus, the Twitter network, as a rich source of data, provides suitable conditions for research in this field to examine the attitudes of different people about this vaccine. The results in one hand will help h...
متن کاملA Comparative Study of Demographic Attribute Inference in Twitter
Social media platforms have become a major gateway to receive and analyze public opinions. Understanding users can provide invaluable context information of their social media posts and significantly improve traditional opinion analysis models. Demographic attributes, such as ethnicity, gender, age, among others, have been extensively applied to characterize social media users. While studies ha...
متن کاملDesign and Test of the Real-time Text mining dashboard for Twitter
One of today's major research trends in the field of information systems is the discovery of implicit knowledge hidden in dataset that is currently being produced at high speed, large volumes and with a wide variety of formats. Data with such features is called big data. Extracting, processing, and visualizing the huge amount of data, today has become one of the concerns of data science scholar...
متن کاملA High-Performance Model based on Ensembles for Twitter Sentiment Classification
Background and Objectives: Twitter Sentiment Classification is one of the most popular fields in information retrieval and text mining. Millions of people of the world intensity use social networks like Twitter. It supports users to publish tweets to tell what they are thinking about topics. There are numerous web sites built on the Internet presenting Twitter. The user can enter a sentiment ta...
متن کامل